智能论文笔记

Toward smart composites: small-scale, untethered prediction and control for soft sensor/actuator systems

Sarah Aguasvivas Manzano , Vani Sundaram , Artemis Xu , Khoi Ly , Mark Rentschler , Robert Shepherd , Nikolaus Correll

分类：机器人 | 机器学习

2022-05-22

我们提供了公式和开源工具，以使用学识渊博的前动力学和设备计算来实现传感器/执行器系统的内部模型预测控制。微控制器单元（MCUS）在与传感器和执行器共关联时计算预测和控制任务的微控制器单元（MCUS）可以实现内部不受束缚的行为。在这种方法中，小型参数大小神经网络模型离线学习前进运动学。我们的开源编译器NN4MC生成代码以将这些预测卸载到MCUS上。然后，牛顿 - 拉夫森求解器实时计算控件输入。我们首先基准在质量 - 弹簧抑制剂模拟上针对PID控制器的这种非线性控制方法。然后，我们在两个具有不同传感，驱动和计算硬件的实验钻机上研究实验结果：具有嵌入式照明传感器的基于肌腱的平台和带有磁性传感器的基于HASEL的平台。实验结果表明，具有较小的内存足迹（小于或等于闪存的6.4％）的参考路径（大于或等于120 Hz）的有效高带宽跟踪。在基于肌腱的平台中，测得的误差之后路径不超过2mm。在基于HASEL的平台中，模拟路径以下误差不超过1mm。这种方法在ARM Cortex-M4F设备中的平均功耗为45.4 MW。这种控制方法还与Tensorflow Lite模型和等效的在设备代码兼容。内物质智能使一类新的复合材料将自主权注入具有精制人工本体感受的结构和系统。

translated by 谷歌翻译

Application of Machine Learning to Sleep Stage Classification

Andrew Smith , Hardik Anand , Snezana Milosavljevic , Katherine M. Rentschler , Ana Pocivavsek , Homayoun Valafar

分类：机器学习

2021-11-04

睡眠研究必须携带与睡眠损失相关的表型和有助于精神病理学的露出机制。最常见的是，调查人员手动将多色网络分类为警惕状态，这是耗时的，需要广泛的培训，并且容易出现帧间间变异性。虽然许多作品已经基于多个EEG通道成功开发了自动化状态分类器，但是我们的目标是生产一种自动化和开放式分类器，可以基于来自啮齿动物的单个皮质脑电图（EEG）来可靠地预测警惕状态，以最大限度地减少伴随的缺点通过电线束缚小动物到计算机程序。大约427小时的连续监测的脑电图，电灰度（EMG）和活性由总数据的571小时的域专家标记。在这里，我们评估各种机器学习技术对分类10-秒钟时期的各种机器学习技术的性能，进入三个离散类中的一种：矛盾，慢波或唤醒。我们的调查包括决策树，随机森林，天真贝叶斯分类器，Logistic回归分类器和人工神经网络。这些方法达到了约74％至约96％的精度。最值得注意的是，随机森林和巢穴分别实现了95.78％和93.31％的显着准确性。在这里，我们已经示出了各种机器学习分类器的潜力，以基于单个EEG读数和单一EMG读数自动，准确地和可靠地对警惕状态进行自动。

translated by 谷歌翻译

Using Active Learning Methods to Strategically Select Essays for Automated Scoring

Tahereh Firoozi , Hamid Mohammadi , Mark J. Gierl

分类：自然语言处理

2023-01-02

Research on automated essay scoring has become increasing important because it serves as a method for evaluating students' written-responses at scale. Scalable methods for scoring written responses are needed as students migrate to online learning environments resulting in the need to evaluate large numbers of written-response assessments. The purpose of this study is to describe and evaluate three active learning methods than can be used to minimize the number of essays that must be scored by human raters while still providing the data needed to train a modern automated essay scoring system. The three active learning methods are the uncertainty-based, the topological-based, and the hybrid method. These three methods were used to select essays included as part of the Automated Student Assessment Prize competition that were then classified using a scoring model that was training with the bidirectional encoder representations from transformer language model. All three active learning methods produced strong results, with the topological-based method producing the most efficient classification. Growth rate accuracy was also evaluated. The active learning methods produced different levels of efficiency under different sample size allocations but, overall, all three methods were highly efficient and produced classifications that were similar to one another.

translated by 谷歌翻译

Planning Paths through Occlusions in Urban Environments

Yutao Han , Youya Xia , Guo-Jun Qi , Mark Campbell

分类：机器人

2022-12-29

This paper presents a novel framework for planning in unknown and occluded urban spaces. We specifically focus on turns and intersections where occlusions significantly impact navigability. Our approach uses an inpainting model to fill in a sparse, occluded, semantic lidar point cloud and plans dynamically feasible paths for a vehicle to traverse through the open and inpainted spaces. We demonstrate our approach using a car's lidar data with real-time occlusions, and show that by inpainting occluded areas, we can plan longer paths, with more turn options compared to without inpainting; in addition, our approach more closely follows paths derived from a planner with no occlusions (called the ground truth) compared to other state of the art approaches.

translated by 谷歌翻译

Feature Acquisition using Monte Carlo Tree Search

Sungsoo Lim , Diego Klabjan , Mark Shapiro

分类：机器学习

2022-12-21

Feature acquisition algorithms address the problem of acquiring informative features while balancing the costs of acquisition to improve the learning performances of ML models. Previous approaches have focused on calculating the expected utility values of features to determine the acquisition sequences. Other approaches formulated the problem as a Markov Decision Process (MDP) and applied reinforcement learning based algorithms. In comparison to previous approaches, we focus on 1) formulating the feature acquisition problem as a MDP and applying Monte Carlo Tree Search, 2) calculating the intermediary rewards for each acquisition step based on model improvements and acquisition costs and 3) simultaneously optimizing model improvement and acquisition costs with multi-objective Monte Carlo Tree Search. With Proximal Policy Optimization and Deep Q-Network algorithms as benchmark, we show the effectiveness of our proposed approach with experimental study.

translated by 谷歌翻译

Universal versus system-specific features of punctuation usage patterns in~major Western~languages

Tomasz Stanisz , Stanislaw Drozdz , Jaroslaw Kwapien

分类：自然语言处理

2022-12-21

The celebrated proverb that "speech is silver, silence is golden" has a long multinational history and multiple specific meanings. In written texts punctuation can in fact be considered one of its manifestations. Indeed, the virtue of effectively speaking and writing involves - often decisively - the capacity to apply the properly placed breaks. In the present study, based on a large corpus of world-famous and representative literary texts in seven major Western languages, it is shown that the distribution of intervals between consecutive punctuation marks in almost all texts can universally be characterised by only two parameters of the discrete Weibull distribution which can be given an intuitive interpretation in terms of the so-called hazard function. The values of these two parameters tend to be language-specific, however, and even appear to navigate translations. The properties of the computed hazard functions indicate that among the studied languages, English turns out to be the least constrained by the necessity to place a consecutive punctuation mark to partition a sequence of words. This may suggest that when compared to other studied languages, English is more flexible, in the sense of allowing longer uninterrupted sequences of words. Spanish reveals similar tendency to only a bit lesser extent.

translated by 谷歌翻译

The Third International Verification of Neural Networks Competition (VNN-COMP 2022): Summary and Results

Mark Niklas Müller , Christopher Brix , Stanley Bak , Changliu Liu , Taylor T. Johnson

分类：机器学习 | 人工智能

2022-12-20

This report summarizes the 3rd International Verification of Neural Networks Competition (VNN-COMP 2022), held as a part of the 5th Workshop on Formal Methods for ML-Enabled Autonomous Systems (FoMLAS), which was collocated with the 34th International Conference on Computer-Aided Verification (CAV). VNN-COMP is held annually to facilitate the fair and objective comparison of state-of-the-art neural network verification tools, encourage the standardization of tool interfaces, and bring together the neural network verification community. To this end, standardized formats for networks (ONNX) and specification (VNN-LIB) were defined, tools were evaluated on equal-cost hardware (using an automatic evaluation pipeline based on AWS instances), and tool parameters were chosen by the participants before the final test sets were made public. In the 2022 iteration, 11 teams participated on a diverse set of 12 scored benchmarks. This report summarizes the rules, benchmarks, participating tools, results, and lessons learned from this iteration of this competition.

translated by 谷歌翻译

Extrinsic Evaluation of Machine Translation Metrics

Nikita Moghe , Tom Sherborne , Mark Steedman , Alexandra Birch

分类：自然语言处理 | 人工智能

2022-12-20

Automatic machine translation (MT) metrics are widely used to distinguish the translation qualities of machine translation systems across relatively large test sets (system-level evaluation). However, it is unclear if automatic metrics are reliable at distinguishing good translations from bad translations at the sentence level (segment-level evaluation). In this paper, we investigate how useful MT metrics are at detecting the success of a machine translation component when placed in a larger platform with a downstream task. We evaluate the segment-level performance of the most widely used MT metrics (chrF, COMET, BERTScore, etc.) on three downstream cross-lingual tasks (dialogue state tracking, question answering, and semantic parsing). For each task, we only have access to a monolingual task-specific model. We calculate the correlation between the metric's ability to predict a good/bad translation with the success/failure on the final task for the Translate-Test setup. Our experiments demonstrate that all metrics exhibit negligible correlation with the extrinsic evaluation of the downstream outcomes. We also find that the scores provided by neural metrics are not interpretable mostly because of undefined ranges. Our analysis suggests that future MT metrics be designed to produce error labels rather than scores to facilitate extrinsic evaluation.

translated by 谷歌翻译

Eff-3DPSeg: 3D organ-level plant shoot segmentation using annotation-efficient point clouds

Liyi Luo , Xintong Jiang , Yu Yang , Eugene Roy Antony Samy , Mark Lefsrud , Valerio Hoyos-Villegas , Shangpeng Sun

分类：计算机视觉 | 人工智能

2022-12-20

Reliable and automated 3D plant shoot segmentation is a core prerequisite for the extraction of plant phenotypic traits at the organ level. Combining deep learning and point clouds can provide effective ways to address the challenge. However, fully supervised deep learning methods require datasets to be point-wise annotated, which is extremely expensive and time-consuming. In our work, we proposed a novel weakly supervised framework, Eff-3DPSeg, for 3D plant shoot segmentation. First, high-resolution point clouds of soybean were reconstructed using a low-cost photogrammetry system, and the Meshlab-based Plant Annotator was developed for plant point cloud annotation. Second, a weakly-supervised deep learning method was proposed for plant organ segmentation. The method contained: (1) Pretraining a self-supervised network using Viewpoint Bottleneck loss to learn meaningful intrinsic structure representation from the raw point clouds; (2) Fine-tuning the pre-trained model with about only 0.5% points being annotated to implement plant organ segmentation. After, three phenotypic traits (stem diameter, leaf width, and leaf length) were extracted. To test the generality of the proposed method, the public dataset Pheno4D was included in this study. Experimental results showed that the weakly-supervised network obtained similar segmentation performance compared with the fully-supervised setting. Our method achieved 95.1%, 96.6%, 95.8% and 92.2% in the Precision, Recall, F1-score, and mIoU for stem leaf segmentation and 53%, 62.8% and 70.3% in the AP, AP@25, and AP@50 for leaf instance segmentation. This study provides an effective way for characterizing 3D plant architecture, which will become useful for plant breeders to enhance selection processes.

translated by 谷歌翻译

True Detective: A Challenging Benchmark for Deep Abductive Reasoning \\in Foundation Models

Maksym Del , Mark Fishel

分类：自然语言处理

2022-12-20

Large language models (LLMs) have demonstrated strong performance in zero-shot reasoning tasks, including abductive reasoning. This is reflected in their ability to perform well on current benchmarks in this area. However, to truly test the limits of LLMs in abductive reasoning, a more challenging benchmark is needed. In this paper, we present such a benchmark, consisting of 191 long-form mystery stories, each approximately 1200 words in length and presented in the form of detective puzzles. Each puzzle includes a multiple-choice question for evaluation sourced from the "5 Minute Mystery" platform. Our results show that state-of-the-art GPT models perform significantly worse than human solvers on this benchmark, with an accuracy of 28\% compared to 47\% for humans. This indicates that there is still a significant gap in the abductive reasoning abilities of LLMs and highlights the need for further research in this area. Our work provides a challenging benchmark for future studies on reasoning in language models and contributes to a better understanding of the limits of LLMs' abilities.

translated by 谷歌翻译